Back

Science China Life Sciences

Springer Science and Business Media LLC

Preprints posted in the last 7 days, ranked by how well they match Science China Life Sciences's content profile, based on 26 papers previously published here. The average preprint has a 0.08% match score for this journal, so anything above that is already an above-average fit.

1
A high-throughput Epstein-Barr virus nuclear antigen 1 (EBNA1) serology test strip for nasopharyngeal carcinoma risk screening

Warner, B. E.; Patel, J.; Satterwhite, R.; Wang, R.; Adams-Haduch, J.; Koh, W.-P.; Yuan, J.-M.; Shair, K. H. Y.

2026-04-13 infectious diseases 10.64898/2026.04.08.26350329 medRxiv
Top 2%
0.7%
Show abstract

PurposeAntibodies to Epstein-Barr virus (EBV) proteins can predict nasopharyngeal carcinoma (NPC) risk. We previously defined a prototype EBNA1 protein panel and multiplex immunoblot assay that distinguishes NPC risk several years pre-diagnosis. Assay throughput and specificity are critical to effectively implement a population-level screening program. Here, we developed a strip test assay - EBNA1 SeroStrip-HT - with an objective to increase throughput and maximize specificity. Experimental DesignEBNA1 full-length (FL) and glycine-alanine repeat deletion mutants (dGAr) were purified from insect and mammalian cells to screen serum IgA/IgG from prospective cohorts in Singapore and Shanghai, China, with known time intervals to NPC diagnosis. Twenty pre-diagnostic sera within 4 years to diagnosis were compared to 96 healthy controls using a nested case-control study design. ResultsIgA to mammalian-derived EBNA1 dGAr achieved 85.0% sensitivity and 94.8% specificity (AUC, 0.939) for NPC status. IgA to insect-derived EBNA1 dGAr showed the same sensitivity (85.0%) and similar specificity (93.8%) (AUC, 0.941). IgA to insect-derived EBNA1 FL had a higher 90% sensitivity, but lower 91.7% specificity (AUC, 0.940). Combining EBNA1 FL and dGAr results showed that subjects positive for both proteins had a 243.67 odds ratio for NPC incidence compared to double-negative scores. ConclusionThis study demonstrated the efficacy of EBNA1 SeroStrip-HT for NPC risk assessment and stratification in high- and intermediate-risk populations, yielding high accuracy and a 12-fold increased throughput over the prototype. The insect system was appropriate for large-scale production of purified EBNA1. Larger, geographically diverse cohorts are warranted to confirm these results, especially in low-incidence populations.

2
VAE (Variational Autoencoder) Based Gastrotype Identification and Predictive Diagnosis of Helicobacter pylori Infection

Ma, Z.; Qiao, Y.

2026-04-13 gastroenterology 10.64898/2026.04.11.26350690 medRxiv
Top 3%
0.6%
Show abstract

Background: The enterotype concept proposed that gut microbiomes cluster into discrete types, but subsequent critiques demonstrated that such clustering depends on methodological choices, that the number of clusters is not fixed, and that faecal samples cannot capture spatial heterogeneity along the gastrointestinal tract. The stomach remains particularly understudied, and no systematic classification exists for gastric microbial community types. Methods: We assembled a multi-cohort dataset of 566 gastric mucosal samples spanning healthy controls to gastric cancer, with both Helicobacter pylori (HP)-negative and HP-positive individuals. Critically, we applied the key methodological lessons of the enterotype debate: we used a variational autoencoder (VAE) for dimensionality reduction to learn a continuous latent representation without forcing discrete structure, determined the optimal number of clusters using the Silhouette index (an absolute validation measure) across K=2 to K=10 rather than arbitrarily selecting a cluster number, and performed transparent evaluation of multiple clustering solutions. This VAE-plus-silhouette workflow directly addresses the critiques leveled against the original enterotype analysis. Results: Four gastotypes were identified, with K=4 achieving the highest mean silhouette score, indicating good cluster cohesion and separation. Two gastotypes (Variovorax-type and Trabulsiella-type) were significantly enriched in HP-positive samples, while two gastotypes (Bacteroides-type and Streptococcus-type) were significantly enriched in HP-negative samples. Random Forest and Gradient Boosting achieved excellent baseline performance for predicting HP infection (AUC = 0.990 and 0.993). Conclusions: The VAE-plus-silhouette workflow provides a robust, data-driven approach for identifying gastotypes without forcing discrete structure or arbitrarily fixing cluster numbers. Using this framework, we identified four gastotypes with significantly different HP infection rates. Variovorax-type and Trabulsiella-type showed strong HP-positive enrichment, while Bacteroides-type and Streptococcus-type showed strong HP-negative enrichment. These findings demonstrate that methodological advances from the enterotype controversy can be successfully transferred to the stomach, offering a reproducible taxonomy for stratifying HP infection status with potential clinical utility.

3
Transmission dynamics of the COVID-19 pandemic across the emerging variants in mainland China: a hypergraph-based spatiotemporal modeling study

Wang, Y.; WANG, D.; Lau, Y. C.; Du, Z.; Cowling, B. J.; Zhao, Y.; Ali, S. T.

2026-04-17 public and global health 10.64898/2026.04.16.26351004 medRxiv
Top 4%
0.4%
Show abstract

Mainland China experienced multiple waves of COVID19 pandemic during 2020 2022, driven by emerging variants and changes in public health and social measures (PHSMs). We developed a hypergraph-based Susceptible Vaccinated Exposed Infectious Recovered Susceptible (SVEIRS) model to reconstruct epidemic dynamics across 31 provinces, capturing transmission heterogeneity associated with clustered contacts. We assessed key characteristics of transmission at national and provincial levels during four outbreak periods: initial, localized predelta, Delta, and widespread Omicron, which accounted for 96.7% of all infections. We found significant diversity in transmission contributions across cluster sizes, with a small fraction of larger clusters responsible for a disproportionate share of infections. Counterfactual analyses showed that reducing clustersize heterogeneity, while holding overall exposure constant, could have lowered national infections by 11.70 to 30.79%, with the largest effects during Omicron period. Ascertainment rates increased over time but remained spatially heterogeneous with a range: (14.40, 71.93)%. Population susceptibility declined following mass vaccination (to 42.49% in Aug 2021, nationally) and rebounded (to 89.89% in Nov 2022) due to waning immunity with variations across the provinces. Effective reproduction numbers displayed marked temporal and spatial variability, with higher estimates during Omicron. Overall, these results highlight critical role of group contact heterogeneity in shaping epidemic dynamics.

4
Cross-cultural adaptation and psychometric validation of the ISBAR Structured Handover Observation Tool in ICU-to-ward patient transfer

Ni, N.; Zhao, B.; Wang, Y.; Wang, Q.; Ding, J.; Liu, T.

2026-04-14 nursing 10.64898/2026.04.10.26350669 medRxiv
Top 4%
0.3%
Show abstract

Abstract The ISBAR framework is used to standardize clinical handovers and enhance patient safety. Observational tools based on ISBAR have been developed to assess the completeness of information transfer. However, these instruments have primarily been developed in non-Chinese contexts, and validated Chinese-language observational tools suitable for clinical practice remain limited. In this study, a cross-cultural adaptation and psychometric validation of the ISBAR Structured Handover Observation Tool was conducted, examining its reliability and discriminant validity in Chinese clinical settings. The study was conducted in two phases: cross-cultural adaptation and psychometric evaluation in real-world clinical settings. Content validity was assessed using the Content Validity Index (CVI), and inter-rater reliability was evaluated using the Intraclass Correlation Coefficient (ICC) based on a two-way mixed-effects model with absolute agreement. Discriminant validity was examined using the Mann-Whitney U test to compare scores across nurses with varying levels of clinical experience. A total of 233 handover cases involving patient transfers from the intensive care unit (ICU) to general wards were collected, involving 84 nurses. The scale demonstrated good content validity, with item-level content validity indices (CVI) ranging from 0.88 to 1.00 and a scale-level CVI/Ave of 0.98. The inter-rater reliability, assessed using fifty randomly selected cases, was high, with an intraclass correlation coefficient (ICC) of 0.885 for single-rater assessments and 0.939 for average-rater assessments. Discriminant validity analysis showed that nurses with more clinical experience had significantly higher total scores than those with less experience (Z = -4.772, p < 0.001). The Chinese version of the ISBAR Structured Handover Observation Tool demonstrates good content validity, high inter-rater reliability, and acceptable discriminant validity. This tool provides a standardized and practical method for assessing the completeness of information transfer and is expected to support quality improvement in patient handover from the ICU to general wards in Chinese clinical settings.

5
Efficient generation of epitope-targeted de novo antibodies with Germinal

Mille-Fragoso, L. S.; Driscoll, C. L.; Wang, J. N.; Dai, H.; Widatalla, T. M.; Zhang, J. L.; Zhang, X.; Rao, B.; Feng, L.; Hie, B. L.; Gao, X. J.

2026-04-15 synthetic biology 10.1101/2025.09.19.677421 medRxiv
Top 5%
0.3%
Show abstract

Obtaining novel antibodies against specific protein targets is a widely important yet experimentally laborious process. Meanwhile, computational methods for antibody design have been limited by low success rates that currently require resource-intensive screening. Here, we introduce Germinal, a broadly enabling generative pipeline that designs antibodies against specific epitopes with nanomolar binding affinities while requiring only low-n experimental testing. Our method co-optimizes antibody structure and sequence by integrating a structure predictor with an antibody-specific protein language model to perform de novo design of functional complementarity-determining regions (CDRs) onto a user-specified structural framework. When tested against four diverse protein targets, Germinal successfully designed functional antibodies across all targets and binder formats, testing only 43-101 designs for each antigen. Validated designs also exhibited robust expression in mammalian cells and high sequence and structural novelty. We provide open-source code and full computational and experimental protocols to facilitate wide adoption. Germinal represents a milestone in efficient, epitope-targeted de novo antibody design, with notable implications for the development of molecular tools and therapeutics.

6
De novo designed bifunctional proteins for targeted protein degradation

Mylemans, B.; Korona, B.; Acevedo-Jake, A. M.; MacRae, A.; Edwards, T. A.; Huang, D. T.; Wilson, A. J.; Itzhaki, L. S.; Woolfson, D. N.

2026-04-15 synthetic biology 10.64898/2025.12.22.695915 medRxiv
Top 5%
0.3%
Show abstract

Targeted protein degradation (TPD) is a therapeutic strategy to remove disease-causing proteins by routing them to the ubiquitin-proteasome, autophagy, or lysosme machineries. For instance, proteolysis-targeting chimeras (PROTACs) are synthetic hetero-bifunctional small molecules that simultaneously bind the target and an E3 ubiquitin ligase to drive ubiquitination and degradation by the proteasome. Despite considerable success, designing such molecules is challenging and the number of currently addressable ubiquitin E3 ligases is limited. Here we demonstrate hetero-bifunctional de novo designed proteins as alternatives for TPD to access more targets and ligases. First, we develop a stable and highly adaptable helix-turn-helix scaffold for presenting different binding sites. Next, we use computational protein design to incorporate and embellish hot-spot- binding sites to target BCL-xL, plus short linear motifs (SLiMs) for KLHL20 ligase recruitment. The resulting mono- and bi-functionalised proteins bind the targets in vitro, and the latter degrade BCL-xL in cells leading to apoptosis.

7
Deep-learning-Assisted Photoacoustic and Ultrasound Evaluation for Pre-transplant Human Liver Graft Quality and Transplant Suitability

Zhang, Q.; Tang, Q.; Vu, T.; Pandit, K.; Cui, Y.; Yan, F.; Wang, N.; Li, J.; Yao, A.; Menozzi, L.; Fung, K.-M.; Yu, Z.; Parrack, P.; Ali, W.; Liu, R.; Wang, C.; Liu, J.; Hostetler, C. A.; Milam, A. N.; Nave, B.; Squires, R. A.; Battula, N. R.; Pan, C.; Martins, P. N.; Yao, J.

2026-04-15 transplantation 10.64898/2026.04.13.26350786 medRxiv
Top 6%
0.2%
Show abstract

End-stage liver disease (ESLD) is one of the leading causes of death worldwide. Currently, the only curative option for patients with ESLD is liver transplantation. However, the demand for donor livers far exceeds the available supply, partly because many potentially viable livers are discarded following biopsy evaluation. While biopsy is the gold standard for assessing liver histological features related to graft quality and transplant suitability, it often leads to high discard rates due to its susceptibility to sampling errors and limited spatial coverage. Besides, biopsy is invasive, time-consuming, and unavailable in clinical facilities with limited resources. Here, we present an AI-assisted photoacoustic/ultrasound (PA/US) imaging framework for quantitative assessment of human donor liver graft quality and transplant suitablity at the whole-organ scale. With multimodal volumetric PA/US images as the input, our deep-learning (DL) model accurately predicted the risk level of fibrosis and steatosis, which indicate the graft quality and transplant suitability, when comparing with true pathological scores. DL also identified the imaging modes (PAI wavelength and B-mode USI) that correlated the most with prediction accuracy, without relying on ill-posed spectral unmixing. Our method was evaluated in six discarded human donor livers comprising sixty spatially matched regions of interest. Our study will pave the way for a new standard of care in organ graft quality and transplant suitability that is fast, noninvasive, and spatially thorough to prevent unnecessary organ discards in liver transplantation.

8
Identification, evolutionary history and characteristics of orphan genes in root-knot nematodes

Seckin, E.; Colinet, D.; Bailly-Bechet, M.; Seassau, A.; Bottini, S.; Sarti, E.; Danchin, E. G.

2026-04-11 bioinformatics 10.64898/2025.12.19.695360 medRxiv
Top 6%
0.2%
Show abstract

Orphan genes, lacking homologs in other species, are systematically found across genomes. Their presence may result from extensive divergence from pre-existing genes or from de novo gene birth, which occurs when a gene emerges from a previously non-genic region. In this study, we identified orphan genes in the genomes of globally distributed plant-parasitic nematodes of the genus Meloidogyne and investigated their origins, evolution, and characteristics. Using a comparative genomics framework across 85 nematode species, we found that 18% of Meloidogyne genes are genus-specific, transcriptionally supported orphans. By combining ancestral sequence reconstruction and synteny-based approaches, we inferred that 20% of these orphan genes originated through high divergence, while 18% likely emerged de novo. Proteomic and translatomic evidence confirmed the translation of a subset of these genes, and feature analyses revealed distinctive molecular signatures, including shorter length, signal peptide enrichment, and a tendency for extracellular localization. These findings highlight orphan genes as a substantial and previously underexplored component of the Meloidogyne genome, with potential roles in their worldwide parasitism.

9
Educational Browser-Native SIR Simulation: Analytical Benchmarks Showing Numerical Accuracy for Lightweight Epidemic Modeling

Ben-Joseph, J.

2026-04-17 epidemiology 10.64898/2026.04.15.26350961 medRxiv
Top 9%
0.1%
Show abstract

Lightweight epidemic calculators are widely used for teaching and rapid scenario exploration, yet many omit the methodological detail needed for scientific reuse. We present a browser-native SIR calculator that exposes forward Euler and classical fourth-order Runge--Kutta (RK4) integration alongside epidemiologically interpretable outputs and a population-conservation diagnostic. The implementation is anchored to analytical properties of the deterministic SIR system, including the epidemic threshold, the peak condition, and the final-size relation. Benchmark experiments show that RK4 is essentially step-size invariant over practical discretizations, whereas Euler at a coarse one-day step overestimates peak prevalence by 3.97% and final size by 0.66% relative to a fine-step RK4 reference. These results demonstrate that browser-based tools can support publication-quality computational narratives when solver choice, diagnostics, and assumptions are treated as first-class outputs.

10
Imaging Mass Cytometry (IMC) as a Tool to Characterize Circulating Tumor Cells (CTCs) in Preclinical Mouse Models

Pore, M.; Balamurugan, K.; Atkinson, A.; Breen, D.; Mallory, P.; Cardamone, A.; McKennett, L.; Newkirk, C.; Sharan, S.; Bocik, W.; Sterneck, E.

2026-04-16 cancer biology 10.64898/2025.12.18.695262 medRxiv
Top 9%
0.1%
Show abstract

Circulating tumor cells (CTCs), and especially CTC-clusters, are linked to poor prognosis and may reveal mechanisms of metastasis and treatment resistance. Therefore, developing unbiased methods for the functional characterization of CTCs in liquid biopsies is an urgent need. Here, we present an evaluation of multiplex imaging mass cytometry (IMC) to analyze CTCs in mice with human xenograft tumors. In a single-step process, IMC uses metal-labeled antibodies to simultaneously detect a large number of proteins/modifications within minimally manipulated small volumes of blood from the tail vein or heart. We used breast cancer cell lines and a patient-derived xenograft (PDX) to assess antibodies for cross-species interpretation. Along with manual verification, HALO-AI-based cell segmentation was used to identify CTCs and quantify markers. Despite some limitations regarding human-specificity, this technology can be used to investigate the effect of genetic and pharmacological interventions on the properties of single and cluster CTCs in tumor-bearing mice.

11
Monitoring-based and self-reported close-contact records in relation to ultra-wideband-derived proximity in a long-term care facility: a single-facility observational study

Shinto, H.; Chowell, G.; Takayama, Y.; Ohki, Y.; Saito, K.; Mizumoto, K.

2026-04-13 infectious diseases 10.64898/2026.04.10.26350570 medRxiv
Top 10%
0.1%
Show abstract

BackgroundIn long-term care facilities (LTCFs), close-contact identification often relies on staff recall and monitoring records because residents may be unable to self-report reliably. How these different record-generation processes relate to proximity-based sensor measurements in routine LTCF workflow remain unclear, and how such differences may influence contact-based decision-making in outbreak response is not well understood. MethodsWe conducted a five-day observational study in a Japanese LTCF using ultra-wideband (UWB) indoor positioning. Twenty-seven participants wore UWB tags, including 16 residents and 11 staff members; 10 staff members completed questionnaires. We compared UWB-derived proximity with questionnaire-derived contacts from staff self-report and monitoring-based proxy records, and assessed directional discrepancies under multiple distance-time thresholds. ResultsQuestionnaire-based records and UWB-derived proximity showed different patterns of discrepancy across contact types. Within this facility, resident-related monitoring-based proxy records showed relatively small directional discrepancies, whereas staff self-reports tended to identify additional resident-staff contacts under the baseline threshold ([&le;]1.0 m for [&ge;]15 min). Several alternative thresholds were associated with discrepancies closer to zero than the baseline, although the apparent ranking varied by summary metric. ConclusionsIn this single-facility observational study, different contact-list generation processes were associated with different patterns of discrepancy relative to a proximity-based operational measure. These findings support interpretation in terms of workflow-specific contact-list generation rather than a single universally optimal threshold and may help inform facility-level review of contact identification practices in LTCFs. These findings support aligning contact identification strategies with facility-specific workflows to improve the feasibility and effectiveness of IPC practices in LTCFs.

12
Noisy periodicity in tropical respiratory disease dynamics

Yang, F.; Hanks, E. M.; Conway, J. M.; Bjornstad, O. N.; Thanh, N. T. L.; Boni, M. F.; Servadio, J. L.

2026-04-13 epidemiology 10.64898/2026.04.10.26350660 medRxiv
Top 10%
0.1%
Show abstract

Infectious disease surveillance systems in tropical countries show that respiratory disease incidence generally manifests as year-round activity with weak fluctuations and irregular seasonality. Previously, using a ten-year time series of influenza-like illness (ILI) collected from outpatient clinics in Ho Chi Minh City (HCMC), Vietnam, we found a combination of nonannual and annual signals driving these dynamics, but with unknown mechanisms. In this study, we use seven stochastic dynamical models incorporating humidity, temperature, and school term to investigate plausible mechanisms behind these annual and nonannual incidence trends. We use iterated filtering to fit the models and evaluate the models by comparing how well they replicate the combination of annual and nonannual signals. We find that a model including specific humidity, temperature, and school term best fits our observed data from HCMC and partially reproduces the irregular seasonality. The estimated effects from specific humidity and temperature on transmission are nonlinearly negative but weak. School dismissal is associated with decreased transmission, but also with low magnitude. Under these weak external drivers, we hypothesize that stochasticity makes a strong sub-annual cycle more likely to be observed in ILI disease dynamics. Our study shows a possible mechanism for respiratory disease dynamics in the tropics. When the external drivers are weak, the seasonality of respiratory disease dynamics is prone to the influence of stochasticity.

13
Clinical Application of CT-Guided Lung Nodule Localization Needles in Preoperative Localization of Small Pulmonary Nodules

Xu, R.; Dou, H.; Zhang, M.; Liu, Z.

2026-04-16 surgery 10.64898/2026.04.13.26350830 medRxiv
Top 13%
0.0%
Show abstract

Background: To investigate the safety and efficacy of CTguided lung nodule localization needles for the preoperative localization of small pulmonary nodules. Methods: A retrospective study was conducted on 102 patients with a total of 113 small pulmonary nodules who underwent preoperative localization at Jinan Fourth People's Hospital from January 2024 to December 2025. Nodule diameter and depth, localization time, the number of pleural punctures, the localization success rate, and postoperative complications (hook dislodgement, hemorrhage, and pneumothorax) were recorded. All patients underwent video assisted thoracoscopic surgery (VATS) after localization. Results: The mean nodule diameter was 0.97{+/-}0.36 cm, the mean depth was 1.26{+/-}0.48 cm, and the mean localization time was 9.8{+/-}3.65 minutes. The hook dislodgement rate was 0.98% (1/102), the intrapulmonary hemorrhage rate was 14.71% (15/102), and the pneumothorax rate was 16.67% (17/102). All pulmonary nodules were successfully resected by VATS at 73.82{+/-}13.83 minutes after localization, and no severe complications occurred. Conclusions: The use of a CTguided lung nodule localization needle for the preoperative localization of small pulmonary nodules decreases the time needed for intraoperative nodule detection and operation time. This strategy is a simple, safe, and accurate preoperative localization method that is worthy of increased clinical use.

14
GPR143, a novel immunohistochemical marker for renal tumors with FLCN/TSC/MTOR-TFE alterations

Li, Q.; Singh, A.; Hu, R.; Huang, W.; Shapiro, D. D.; Abel, E. J.; Zong, Y.

2026-04-13 pathology 10.64898/2026.04.06.26350070 medRxiv
Top 13%
0.0%
Show abstract

Although several ancillary tests are available in limited laboratories, diagnosis of microphthalmia (MiT)/TFE family translocation renal cell carcinoma (tRCC) could be challenging due to diverse and overlapping tumor morphology and the lack of reliable biomarkers. GPNMB has been recently identified as a diagnostic marker for various renal neoplasms with FLCN/TSC/mTOR-TFE alterations. However, the sensitivity and specificity of GPNMB immunostain are suboptimal and the result interpretation in ambiguous cases could be difficult. To search additional biomarkers that could improve the screening sensitivity and predict genetic aberrations in FLCN/TSC/mTOR-TFE pathway in renal tumors, we performed bioinformatic analysis of publicly available cancer databases and found GPR143, a transmembrane protein regulated by MiT transcription factors, was highly expressed in a subset of renal cell carcinomas (RCCs). In two the Cancer Genome Atlas (TCGA) kidney cancer cohorts, RCCs with high levels of GPR143 expression were enriched for renal neoplasms with FLCN/TSC/mTOR-TFE alterations. Similar to GPNMB labeling, GPR143 immunostain was positive in the majority of tRCC cases and renal tumors with FLCN/TSC/mTOR alterations, suggesting that GPR143 could function as another surrogate marker for FLCN/TSC/mTOR-TFE alterations in certain renal tumors. Interestingly, despite the concordant GPR143 and GPNMB immunoreactivity in most renal neoplasms with FLCN/TSC/mTOR-TFE alterations, diffuse GPR143 immunostain was observed in some cases with negative or focal GPNMB labeling. Taken together, our results indicate GPR143 could serve as a useful adjunct marker to improve the sensitivity for screening renal tumors with FLCN/TSC/mTOR-TFE alterations.

15
Fine-Tuning PubMedBERT for Hierarchical Condition Category Classification

Wang, X.; Hammarlund, N.; Prosperi, M.; Zhu, Y.; Revere, L.

2026-04-15 health systems and quality improvement 10.64898/2026.04.13.26350814 medRxiv
Top 14%
0.0%
Show abstract

Automating Hierarchical Condition Category (HCC) assignment directly from unstructured electronic health record (EHR) notes remains an important but understudied problem in clinical informatics. We present HCC-Coder, an end to end NLP system that maps narrative documentation to 115 Centers for Medicare & Medicaid Services(CMS) HCC codes in a multi-label setting. On the test dataset, HCC-Coder achieves a macro-F1 of 0.779 and a micro-F1 of 0.756, with a macro-sensitivity of 0.819 and macro-specificity of 0.998. By contrast, Generative Pre-trained Transformer (GPT)-4o achieves highest score of a macro-F1 of 0.735 and a micro-F1 of 0.708 under five-shot prompting. The fine-tuned model demonstrates consistent absolute improvements of 4%-5% in F1-scores over GPT-4o. To address severe label imbalance, we incorporate inverse-frequency weighting and per-label threshold calibration. These findings suggest that domain-adapted transformers provide more balanced and reliable performance than prompt-based large language models for hierarchical clinical coding and risk adjustment.

16
Uncovering the mechanisms of clinically-relevant altered antibiotic responses of Staphylococcus aureus under wound infection-mimetic conditions

Rieger, C. D.; Molaeitabari, A.; Dahms, T. E. S.; El-Halfawy, O. M.

2026-04-17 microbiology 10.64898/2025.12.22.696073 medRxiv
Top 14%
0.0%
Show abstract

Standard in vitro antimicrobial susceptibility testing (AST) using Mueller-Hinton broth (MHB) does not reflect infection-site conditions, and its results often do not correlate with therapeutic outcomes. Here, we compared the antibiotic susceptibility of methicillin-resistant Staphylococcus aureus (MRSA), a common chronic wound pathogen, in simulated wound fluid (SWF) resembling wound exudate versus MHB, revealing discordant AST results across six of nine tested antibiotic classes. The most significant were 128-fold increased resistance to tetracyclines and 256-fold sensitization to {beta}-lactams in SWF. Tetracycline resistance was mediated by MntC, an extracellular manganese-binding protein, whereas {beta}-lactam sensitization was driven by cell envelope remodelling in SWF. Galleria mellonella wound infection results matched the SWF susceptibility phenotypes, suggesting SWF better predicts in vivo wound infection therapeutic outcomes. These comprehensive phenotypic and mechanistic insights into MRSA antibiotic responses under wound-infection-mimetic conditions with direct in vivo validation identify a potential new antibiotic adjuvant target and may guide improved antibiotic therapy for MRSA wound infections.

17
GRASP: Gene-relation adaptive soft prompt for scalable and generalizable gene network inference with large language models

Feng, Y.; Deng, K.; Guan, Y.

2026-04-14 bioinformatics 10.1101/2025.10.20.683485 medRxiv
Top 15%
0.0%
Show abstract

Gene networks (GNs) encode diverse molecular relationships and are central to interpreting cellular function and disease. The heterogeneity of interaction types has led to computational methods specialized for particular network contexts. Large language models (LLMs) offer a unified, language-based formulation of GN inference by leveraging biological knowledge from large-scale text corpora, yet their effectiveness remains sensitive to prompt design. Here, we introduce Gene-Relation Adaptive Soft Prompt (GRASP), a parameter-efficient and trainable framework that conditions inference on each gene pair through only three virtual tokens. Using factorized gene-specific and relation-aware components, GRASP learns to map each pair's biological context into compact soft prompts that combine pair-specific signals with shared interaction patterns. Across diverse GN inference tasks, GRASP consistently outperforms alternative prompting strategies. It also shows a stronger ability to recover unannotated interactions from synthetic negative sets, suggesting its capacity to identify biologically meaningful relationships beyond existing databases. Together, these results establish GRASP as a scalable and generalizable prompting framework for LLM-based GN inference.

18
SARS-CoV-2 Introductions into Lao PDR Revealed by Genomic Surveillance, 2021-2024

Panapruksachat, S.; Troupin, C.; Souksavanh, M.; Keeratipusana, C.; Vongsouvath, M.; Vongphachanh, S.; Vongsouvath, M.; Phommasone, K.; Somlor, S.; Robinson, M. T.; Chookajorn, T.; Kochakarn, T.; Day, N. P.; Mayxay, M.; Letizia, A. G.; Dubot-Peres, A.; Ashley, E. A.; Buchy, P.; Xangsayarath, P.; Batty, E. M.

2026-04-13 epidemiology 10.64898/2026.04.09.26349480 medRxiv
Top 15%
0.0%
Show abstract

We used 2492 whole genome sequences from Laos to investigate the molecular epidemiology of SARS-CoV-2 from 2021 through 2024, covering the major waves of COVID-19 disease in Laos including time periods of travel restrictions and after relaxation of travel across international borders. We identify successive waves of COVID-19 caused by shifts in the dominant lineage, beginning with the Alpha variant in April 2021 and continuing through the Delta and Omicron variants. We quantify a shift from a small number of viral introductions responsible for widespread transmission in early waves to a larger number of introductions for each variant after travel restrictions were lifted, and identify potential routes of introduction into the country. Our study underscores the importance of genomic surveillance to public health responses to characterize viral transmission dynamics during pandemics.

19
Abscess Complications and Prolonged Care in Five-Biomarker-Defined Hypervirulent Klebsiella pneumoniae Bloodstream Infection

Watanabe, N.; Watari, T.; Otsuka, Y.; Matsumiya, T.

2026-04-11 infectious diseases 10.64898/2026.04.10.26350004 medRxiv
Top 15%
0.0%
Show abstract

Background Five-biomarker-defined hypervirulent Klebsiella pneumoniae (hvKp) causes invasive infections, but its burden in bloodstream infections versus classical K. pneumoniae (cKp) is unclear. Methods This retrospective cohort study at a tertiary hospital in Japan included K. pneumoniae bloodstream infection episodes from January 2022-December 2024. hvKp was defined by the presence of all 5 genotypic biomarkers (rmpA, rmpA2, iucA, iroB, and peg-344). The primary outcome was abscess complications, and secondary outcomes were length of stay and antibiotic duration. Whole-genome sequencing was performed for 164 isolates. Results Among the 207 episodes, 28 (14%) were of hvKp. Abscess complication occurred in 17 (61%) hvKp versus 23 (13%) cKp episodes (adjusted odds ratio 10.7; 95% CI, 4.36-26.2). Median length of stay in hvKp versus cKp was 28 versus 14 days (adjusted ratio 1.60; 95% CI, 1.18-2.16) and median antibiotic duration was 43 versus 14 days (adjusted ratio 2.13; 95% CI, 1.64-2.77). These associations were attenuated after adjusting for abscess-related complications. No significant difference in 30-day mortality was observed, although the study was underpowered. Multidrug resistance was less frequent in hvKp strains than in cKp strains (11% vs. 30%; P = .040). Among the sequenced hvKp episodes, abscess rates varied across lineages, from 9 of 10 in ST23 to 1 of 4 in ST412. Conclusions Five biomarker-defined hvKp strains delineated a bloodstream infection subgroup with frequent abscess complications and prolonged care. hvKp and cKp present distinct clinical challenges; diagnostic tools distinguishing these subgroups may aid abscess evaluation and source control.

20
Modelling serological cross-reactivity to disentangle the dynamics of West Nile and Usutu viruses in an emerging area

Bastard, J.; Migne, C.; Helle, T.; Agneray, E.; Bigeard, C.; Boudjadi, Y.; Chevrier, M.; Dumarest, M.; Gondard, M.; Martin-Latil, S.; Mathews-Martin, L.; Petit, T.; Charpentier, T.; Pouillevet, H.; Durand, B.; Metras, R.; Gonzalez, G.

2026-04-17 epidemiology 10.64898/2026.04.07.26350295 medRxiv
Top 15%
0.0%
Show abstract

Zoos may serve as sentinel sites for zoonotic vector-borne diseases. West Nile virus (WNV) and Usutu virus (USUV) are closely related orthoflaviviruses transmitted between Culex mosquitoes and a bird reservoir. Both viruses can also infect mammals, including humans, where they may cause symptoms and, more rarely, hospitalization and death. However, serological cross-reactivity between WNV and USUV complicates their differential diagnosis. Here, we aimed to reconstruct the dynamics of emergence of WNV in a zoo located in a newly affected area in Europe, using ELISA and Virus Neutralization Test (VNT) serological analysis of 1707 animal sera collected between 2015 and 2024. Combining this data in a model accounting for cross-reactivity with USUV, we estimated yearly forces of infection (FOI) by both viruses, and thus found that WNV likely circulated in the area one year prior to the first cases reported to the passive surveillance system. Our results also showed that, in the zoo, mammals and reptiles had a lower risk of infection than birds (relative risk of 0.14 [0.05; 0.28]), and that the exposure of birds to water (aquatic lifestyle or proximity to stagnant water) affected the risk. Finally, we estimated diagnosis parameters, including the sensitivity of the VNT (80.4% [76.5%; 84.3%]), the expected VNT titer value, and the level of serological cross-reactivity between viruses during the VNT. To conclude, our modelling framework allowed to disentangle the co-circulation of two closely related viruses, a crucial point in ensuring the reliable sentinel surveillance of these vector-borne zoonotic pathogens.